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Abstract: We investigate the constructions of tail-biting trellises for linear block codes 
introduced by Koetter/Vardy [9j and Nori/Shankar |15j . For a given code we will define 
the sets of characteristic generators more generally than in [9j and we will investigate how 
^ the choice of characteristic generators affects the set of resulting product trellises, called 

^ KV-trellises. Furthermore, we will show that each KV-trellis is a BCJR-trellis, defined in a 

slightly stronger sense than in [T5] , and that the latter are always non-mergeable. Finally, we 
will address a duality conjecture of Koetter/Vardy by making use of a dualization technique 
of BCJR-trellises and prove the conjecture for minimal trellises. 
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During the last decade conventional and tail-biting trellis representations of linear block codes 
have gained a great deal of attention because both types of trellises do not only reveal code 
structure but also may give rise to efficient decoding algorithms of Viterbi type. 

As a consequence, the quest is on for constructing trellises with low complexity with respect 
to various measures. For conventional trellises it is by now well-known that for a given linear 
block code (and with a fixed coordinate ordering) there exists a unique minimal trellis and 
minimality coincides with biproperness as well as with non-mergeability; see jHl [T2l U] . This 
unique minimal trellis simultaneously minimizes the vertex count at each time as well as 
any other conceivable complexity measure. Several methods exist to construct this minimal 
trellis, the BCJR-construction [UllS], the Forney-construction based in past-future induced 
realizations [1] and others. For an excellent overview on conventional trellises we refer to 
Vardy's survey [18]. 

For tail-biting trellises the situation is considerably more difficult. Due to an additional 
degree of freedom (the cardinality of the vertex space at time 0) minimal tail-biting trellises 
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are not unique and this applies to minimality with respect to various measures, e. g., the 
state complexity profile, the maximum state dimension, the sum of the state dimensions as 
well as to all these notions where we replace the state spaces by the edge spaces, see, for 
instance, [2119]. Moreover, minimality, non-mergeability, and biproperness are not equivalent 
for tail-biting trellises. The fact, however, that the complexity of a tail-biting trellis may be 
much lower than that of the minimal conventional trellis raised a great deal of interest in 
these trellises and, in particular, in the question how to efficiently construct and to analyze 
tail-biting trellises; see also |17| [TT] in addition to the aforementioned references. 

A major breakthrough has been made by Koetter/Vardy in the excellent paper [9]. Therein, 
Koetter/Vardy presented a construction from which all minimal tail-biting trellises can be 
derived. Their construction accounts for all cyclic shifts of the code and results in a list 
of characteristic generators (codewords associated with spans), which then comprise what 
they call the characteristic matrix. It can be shown that each minimal trellis (with respect 
to any of the complexity measures mentioned above) is a KV-trellis, that is, arises as the 
product of the elementary trellises associated with characteristic generators. However, some 
care needs to be exercised at this point because not all minimal trellises of the given code 
can be obtained from the same list of characteristic generators. We will present an example 



later on in 3.15 Furthermore, as observed in [9], not all KV-trellises are minimal trellises. 
But, as we will see later on, they are always non-mergeable. 

A different construction of tail-biting trellises has been introduced by Nori/Shankar in [15J. 
Their construction generalizes the well-known BCJR-construction of conventional trellises. 
The additional degree of freedom, the choice of the vertex space at time 0, is made explicit 
by a displacement matrix which can be regarded as capturing the circular past of the various 
trajectories (cycles) through the trellis. In the conventional case this past is simply zero. 
While the displacement matrix can be chosen arbitrarily, the best trellises arise when the 
matrix is based on the choice of spans for the given basis of the code. Later on we will 
restrict ourselves mainly to those displacement matrices. Nori/Shankar [15] showed that a 
non-mergeable product trellis is isomorphic to the BCJR-trellis based on the same span list. 
A particularly nice feature of the tail-biting BCJR-trellises is that they naturally give rise to 
a dualization technique, resulting in a trellis for the dual code with the same state complexity 
profile. 

In this paper we will discuss the relation between the Koetter/Vardy and Nori/Shankar 
approach and present further properties. We will show that BCJR-trellises (in a slightly 
stronger sense than in p^5]) are non-mergeable and that all KV-trellises are BCJR-trellises, 
hence non-mergeable. We will further show that any one-to-one product trellis can be merged 
to a BCJR-trellis and merging is accomplished by taking suitable quotients of the vertex 
spaces. Finally, we will discuss a duality conjecture stated in j9]. It deals with a particular 
way of dualizing a KV-trellis. This leads to a trellis with the same state complexity profile, 
but for which it is not a priori clear whether it represents the dual code. We will show that, 
in general, this depends on the choice of the characteristic matrix and we will prove that 
for minimal trellises there always exists a suitable choice of the dual characteristic matrix 
such that the conjecture holds true. In that case the dual trellis obtained via Koetter/Vardy's 
characteristic matrix turns out to be the same as the BCJR-dual. The main tool for the proof 
is a dualization technique going back to Mittelholzer |13j and Forney which amounts to 
dualizing the edge spaces with respect to a particular bilinear form. As the proof will show, 
for minimal trellises the resulting edge space dual coincides with the BCJR-dual. We will 
conclude the paper with a list of open problems. 
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2 Preliminaries 



In this section we introduce the basic notions for tail-biting trehises. Throughout the paper 
let F be a finite field. 

A tail-biting trellis T = {V,E) over F of depth n is a directed edge- labeled graph with the 
property that the vertex set V partitions into n disjoint sets F = Vq U Vi U . . . U Vn-i such 
that every edge in T that starts in Vi ends in T^j+imodn and where the edges are labeled with 
field elements from F. Thus, the edge set E decomposes into E = IJI^o^ where Ei is the 
set of edges starting in Vi and ending in Fj+imodn- A cycle in T is a closed path of length n 
in the trellis. We will always assume that the cycles start and end (at the same point) in Vq. 
If I Vol = 1, the trellis is called conventional (we will not consider conventional trellises where 
I Vol > !)• We call the trellis reduced if every vertex and every edge appears in at least one 
cycle. The trellis is called biproper if any two edges starting at the same vertex or ending at 
the same vertex are labeled distinctly. The interval X := {0, . . . ,n — 1} is the time axis of 
the trellis. 

In the sequel we will always identify the edges with their triples consisting of starting point, 
label, and endpoint point. Thus, the edge sets are given by 

Ei = {{v, a, v) I there exist an edge v-^ v where v £ Vi, v G V^+i, a £ F}, i £ I. 

Hence E'j C Vi x F x V^+i and (V, E) completely specifies the trellis. The edge sets have also 
been called trellis section [2l|6] or local constraints [5]. 

Notice that we have to compute modulo n on the time axis X. If the trellis is conventional, 
however, there is no need in doing so because in that case all paths are cycles and the 
endpoints of edges in En-i do not carry any information. 

Each tail-biting trellis gives rise to its edge-label code consisting of the edge-label sequences 
of all cycles in T. We will need to make this and other related notions precise. 

Definition 2.1 Let T = {V,E) be a tail-biting trellis over F of depth n and put W := 

Fo X F X . . . X Vn-l X F. 

(a) The label code of T is defined as 

S{T) := {{vo,Co,Vi, . . .,Vn-l,Cn-l) G W | 

there exist a cycle vq-^ vi-^ . . . Vn-i^^^ vq in T}. 

The trellis T is called linear if T is reduced and the vertex sets Vi,i £ I, are vector spaces 
over F and S{T) is a subspace of W. In this case the edge spaces Ei,i £ I, are subspaces 
of Vi X F X Vi+i because they are projections of S{T). 

(b) The edge-label code of T is defined as 

C(T) := {(cq, . . . , c„_i) £ F" I there exist a cycle vq-^ vi-^ . . . Vn-i^'^^ vq in T}. 

We say that T represents the code C C F" if C(T) = C. Note that if T is linear then 
C(T) is a linear block code. The trellis T is called one-to-one if there exists a bijection 
between C{T) and the cycles in T. That is, every word in C(T) appears as the edge label 
sequence of exactly one cycle. 

(c) If T is linear the state complexity profile (SCP) is defined as (so, . . . , Sn-i); where Si = 
dim Vi. The edge complexity profile (ECP) is defined as (eo, . . • , Cn-i), where ei = dimii^i. 
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(d) Two trellises T = (V, E) and T' = {V' , E') are called isomorphic if there exists a bijection 
cj) ■ V — > V such that 4){Vi) = for all i £ I and such that {v, a, w) S Ei if and only 
if ((/'(f) a, (j){w)) G ii^j'. If r and T' are linear, then we also require (ply. : Vi — > V- be 
isomorphisms. Isomorphic trellises represent the same code. 

(e) Two trellises T = {V, E) and T' = {y\ E') are called structurally isomorphic if there 
exists a bijection (j) : V — > V such that (ftiVi) = V- for alH € X and such that for all 
{v, w) G Vi X Vi+i the number of edges from v £ Vi to w £ Vj+i equals the number of 
edges from (j)(v) S V- to (j){w) G V^^i (thus we disregard the edge labels). Again, if T 
and T' are linear we also require (jyly. : Vi — > be isomorphisms. 

Throughout the paper we will only deal with linear block codes and linear (hence reduced) 
trellis representations. Observe that, in general, structurally isomorphic trellises do not 
represent the same code. However, we will only be interested in this notion for trellises of a 
fixed code. 

We wish to briefly touch upon all these notions from the viewpoint of behavioral systems 
theory. A trellis T can be regarded as a dynamical system with latent variables in the sense 
of |16l Def. 1.3.4], where the latent variables represent the vertex sequence and satisfies the 
axiom of state at each time i, see \19\ Def. 1.3]. In particular, the vertex space Vi is the 
state space at time i. In this context the edge spaces, consisting of all state-output-next state 
triples, are called the evolution law [19l Def. 1.4]. Completeness as defined in [191 P- 184], a 
crucial notion in behavioral systems theory relating local and global behavior does not play 
a role in our setting since, due to the finite time axis, each system is complete. Notice that 
the system is, in general, highly time-varying since even the state space dimensions depend 
on time. The label code S{T) is — up to the ordering of labels and vertices — the full 
behavioi]^ while the edge-label code C{T) is the manifest behavior, see again [161 Def. 1.3.4]. 
If we consider the code C as the given system, which, as usual in behavioral systems theory, is 
identified with its set of trajectories (the codewords), then T is simply a state-space realization 
of C if C(T) = C. Furthermore, isomorphic trellises are equivalent systems in the sense of jT9l 
p. 205]. They merely differ by the labeling of the latent variable (the vertices). The system 
theoretic notion of minimal realizations now becomes minimal trellises, that is, minimizing 
the amount of data needed for a realization. While for conventional trellises this is well- 
understood and amounts to the unique minimal trellis (up to isomorphism) with a variety 
of universal properties, the situation is much more complicated for tail-biting trellises. This 
leads to various notions of minimality for trellises of which only the following one will be 
relevant for us. For other notions see for instance [141 [T2] or |9l p. 2085]. In this context 
we will also introduce the notion of mergeability of trellises (In system theoretic language 
merging as defined below is also referred to as lumping states, see [HI p. 207]). 

Definition 2.2 Let C C F" be a linear block code and T = {V, E) be a linear trellis of C. 

(a) T is called minimal if there exists no linear trellis T' = [V' , E') of C such that \ Vl\ <\Vi\ 
for all i £ I and \Vj\ < \ Vj\ for some j S I. 

(b) T is called non-mergeable if for any i £ I merging any two vertices vi, V2 £ Vi results in a 
trellis T that does not represent C. Here merging means replacing the two vertices vi, V2 
in the trellis by a single vertex v £Vi and replacing each edge of the form (vj, a, w) £ Ei, 
resp. {w, a, vj) £ -Ej-i, where j = 1,2, by the edge {v, a, w) £ Ei, resp. {w, a, v) £ Ei^i. 

Due to our restriction to linear trellises it seems necessary to also address the issue whether 
^Strictly speaking, the full behavior is the space of all paths in T and not just the cycles. 
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merging leads to a potentially non-linear trellis. It turns out however, that mergeability is 
compatible with the linear structure and hence merging can always be accomplished such 
that the resulting trellis is linear again. 

It is not a priori clear whether minimal (linear) trellises exist for a given linear code. For 
conventional trellises this is indeed the case as is well-known since a long time and leads 
to minimizing many other measures as well; see [HI Thm. 4.26]. Moreover, all minimal 
conventional trellises are isomorphic and even minimal in the larger class of all trellises, 
not necessarily linear. Furthermore, for conventional trellises minimality coincides with non- 
mergeability. A nice way of constructing the unique minimal conventional trellis of a given 
code, using realization theory for dynamical systems, has been given by Forney in [Ij. For 
tail-biting trellises minimality has been studied in much detail by Calderbank/Forney/Vardy 
in [2j and Koetter/Vardy in In particular, it has been shown that linear minimal tail-biting 
trellises exist (the minimal conventional trellis is one of them) and there exist non-isomorphic 
ones. This is actually easy to see by constructing the minimal conventional trellis for each 
cyclic shift of the given code and then shifting back that trellis; see [9l p. 2084]. This results 
in many (non-isomorphic) minimal trellises and in each one at least one vertex space Vi is 
trivial. However, in general there exist other minimal trellises (that is, for which none of 
the vertex spaces is trivial); see for instance [9l Exa. 6.1] for the (8,4,4) Hamming code, 
which also appeared in [;2j p. 1449]. Moreover, different from the situation for conventional 
trellises, minimality is stronger than non-mergeability for tail-biting trellises; see for instance 
[9l Exa. 3.3]. The main result of f9] is a construction from which all minimal tail-biting 
trellises of a given code can be derived. Our paper is mainly devoted to investigating this 
construction, which we will therefore introduce in the next section. 

Finally, let us fix the following notation. For a matrix M £ ]p™-x'^ -we denote its row span as 
imM, thus imM := {aM \ a G F™}. If not otherwise stated, C C F" denotes a /c-dimensional 
code of length n over the finite field F with the following generator and parity check matrices: 

C = imG, where G={gi,)= : I = (Go ... G„_i) G F^^", (2.1) 



and 



/ Ho 



jpnx(n-fe)^ ^2.2) 



C = keiH^ := {c G F" | cH^ = 0}, where = 

\Hn-l, 

Hence d G F'^ and Hi G F"-'^ are the columns of G and rows of , respectively, while 
9ii • • • iS'fc G F" are the rows of G. For simplicity we will also assume that the support of C 
is X = {0, . . . , n — 1}, that is, no column of G is zero. Finally, it will be convenient to have a 
notion for the indicator function of a subset ^ C X. Thus, we define /-^ G F", where = 1 
if j G ^ and if- = else. 



3 Product Trellises and KV- Trellises 



We begin with recalling the construction of tail-biting trellises based on products of ele- 
mentary trellises. The underlying notions and constructions are well-known, see jlOj for the 
conventional case and |2t |9j for the tail-biting case. Thereafter, we will introduce the notion 
of a characteristic pair for a given code, from which in essence all minimal tail-biting trellises 
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can be derived. This crucial and very fruitful concept goes back to Koetter/Vardy [H], who in 
turn based it on previous work by Kschischang/Sorokine [TO]. The relation between minimal 
trellises and characteristic matrices will be discussed in detail. 

Due to the cyclic structure of the time axis X for tail-biting trellises, the following interval 
notation has proven to be very convenient. For a, 6 G X we define 



[a, b] 



{a, a + 1, . . . ,b} if a <b, 

{a, a + 1, . . . , n — 1, 0, 1, . . . , 6} if a > 6, 

a, b] = [a,b]\{a} for all a, b. 



(3.1) 



We call the intervals (a, b] and [a, b] linear if a < 6 and circular else. Notice that (a, a] = 0. 
It is easy to see that 

I\ (a, b] = {b, a] for aU a / 6, (3.2) 

hence the complement of a nonempty linear interval is circular and vice versa. Observe also 
that (a, 6] <;=^ (a, b] is linear. 

Definition 3.1 For a nonzero vector c = (cq, . . . , Cn-i) G F" we call the interval (a, b] a 
span of c if Ca / / Cb and Cj = for j [a, b]. In this case we will also call [a, b] the closed 
span of c. 

For instance, the vector v = (0,1,3,0,2,2) G has the spans (1,5], (2,1], (4,2], and (5,4]. 
Notice that every nonzero vector has a unique linear span. It turns out that not including 
the starting point a in the span of x is very convenient for later notation; see, for instance, 
the next definition. 

Definition 3.2 Let c = (cq, . . . , c„_i) G F" and (a, b] be a span of c. The elementary trellis 
for the pair (c, (a,6]) is defined as Tc^[a,b] '■= (^)-^)) where the vertex sets and edge sets 
V = U'lZoVi and E = Ul'^Ei are as fohows: 
if a = 6 then, 

Vi := {0} for ah i G X and Ei := 

ii a ^ b, then 



{(0,0,0)}, ifi/a 
{(0, acj, 0) I a G F}, if i = a; 



0, if i (a, b] 
Vi:= < and Ei r- 

IP , if i G (a, o] 



{(0,0,0)}, ifi0[a, 6] 

{(0, aci, a) I a G F}, if i = a 

{(a, aci, 0) I a G F}, if i = 6 

{{a, aci, a) \ a £ F}, if i G (a, 6 — 1] 



Using the indicator function /("'^l, the above vertex and edge spaces can be written as = 
im (4'^'']) C F and Ei = im (4"''l, c„ 4+'?) C x F x Vi+i. 

Note that T^^(^a,b] is ^ one-to-one linear trellis and C{T) = imc = {ac \ a G F}, the one- 
dimensional code generated by c. 

With the aid of the product construction as introduced in |10^ |9] we can derive trellises for 
higher-dimensional codes. Recall that the product T := T x T' of two trellises T = (y,E) 
and T' = {V, E') of depth n over F is defined as the trelhs f = {V , E), where Vi := V x V- 
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and Ei = [ ((?;, v'),a + a', {w, w')) | {v, a, w) G Ei, {v', a' , w') S E'j'} for all i G X. The product 
trellis f satisfies C(f ) = C(r) + C(r'). 

The most important application of this construction is the product of elementary trellises 
leading to tail-biting trellis representations of linear block codes. Recall the data for the 



given code C C F"' as given in (2.1 ). 
Theorem 3.3 ( [TOl El [9] ) Let 

S ■.= [{ai,bi],l = l,...,k] (3.3) 

be a span list for G, that is, {ai, bi] is a span (linear or circular) for the row gi, I = 1, . . . ,k. 
Then product trellis Tq^s '■= ^gi,(ai,6i] x ... x ^'^^,(0^,6^] represents the code C, that is, 
C(Tq^s) = C. Moreover, Tq^s linear (in particular reduced) and one-to-one. The ver- 
tex spaces of Tq^s are given by 



Vi = im Mi , where Mi 



,k 



G F^^^ and //' - <! ^' ^ 4) 
G^ and/i,-<; ifi0(a;,6;] ' ^"^'^^ 



while the edge spaces are given by Ei = im (Mj, Gi, Mj+i) for i G X. 

Observe that /i^ = ig the indicator function of {ai,bi]. It is straightforward to see that, 

more generally, a product trellis T := (^1,61] x . . . x T^^,(afe,6fe] is one-to-one if and only if 
gi, ■ ■ ■ ,gk are linearly independent]^ 

In the sequel we will only consider one-to-one product trellises (with the exception of Re- 



mark 4.15). In addition to this restriction our product trellises are confined even further 



when compared to the class considered by Koetter/Vardy [9]. Due to Definition 3.1 we only 
consider "shortest spans" for the generators, that is, the vector is nonzero at the endpoints 
of the span (in other words, the closed span is a smallest linear or circular interval containing 
the support). This restriction is justified by the fact that an elementary trellis based on a 
non-shortest span is mergeable and thus so is the product trellis, see O Lemma 4.3]. Of 
course, even with shortest spans a product trellis may be mergeable; we will see examples 
later on. 

The following shift property of product trellises is a direct consequence. Recall that we 
compute with indices modulo n. This, of course, also applies to the span lists. 

Remark 3.4 Let a : F" — > F", (cq, . . . , c„_i) 1 — > (ci, . . . , c„_i, cq) be the left shift on F" 
and let G* G F'^^" be the matrix consisting of the shifted rows cr{gi), I = 1, . . . ,k. If 5 as 
in (3.3) is a span list for G then S* = [(o; — 1, 6; — 1], / = 1, . . . , fc] forms a span list for G*. 



Furthermore, with the notation as in (3.4) the vertex spaces of the product trellis Tg*,s* are 
given by V* = imMj_|_i and the edge spaces are given by E* = im (Mj+i, Gj+i, Mj_|_2) for 
i gX. 



For the product trellises Tq^s it is not hard to give formulas for the SCP and the ECP in 
terms of the span list C. Let k := {1, . . . , k}. 

^There is one extreme case to exclude from this statement; if some spans are of the form (a;, ai] then T is 
one-to-one if and only if the set Q is linearly independent, where G is obtained by taking for each span of the 
form {ai,ai] only one generator and taking all generators with non-empty span. Since in our considerations 
gi, . . . ,gk are always linearly independent this case will not be relevant for us. 
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Proposition 3.5 Let S and Tg,s be as in Theorem 3.3 Put A := {ai, . . . ,afc} and B = 
{61, . . . , 6/c}. Moreover, for each i £ I define 

Ci:={l£k\i€iai,bi]}. (3.5) 

Then the SCP of Tq^s is given by (sq, • ■ ■ , Sn-i), where Si = \Ci\. The ECP (eo, • • • , e„_i) 
satisfies the following. 

(a) If ai, . . . ,aj. are distinct, then Ci = Si + if' for i G X. 
(h) If bi, . . . ,bk are distinct, then ei = Sj+i + if for i G X. 

(c) Let ai, . . . ,ak be distinct and bi,...,bk be distinct. Then the SCP satisEes 

{Si, ifieAnBori^AuB, 

Si + l, ifieA\B, 

Si -I, ifiGB\A, 

and the trellis Tg,s Is biproper. 

Observe that if all spans (a;, 5/] are linear, then the condition ai, . . . , and bi, . . . ,bk both 
being distinct is equivalent to G being an MSGM in the sense of [12^ Def. 6.2, Thm. 6.11] 
(also called "sets of atomic generators" in pD] or "shortest basis" in [3]). Moreover, in this 
case Tg,s is the unique minimal conventional trellis of C and the formulas given in (a) and (b) 
coincide with those known for conventional trellises; see, for instance, |12l p. 1080]. However, 
it is well-known that if not all spans are linear the (tail-biting) product trellis is in general 
not minimal; see, for instance. Example |4. 10 later on. 



Proof: The first statement about the SCP is immediate from the definition of the vertex 



spaces Vi = imMj in (3.4). For the ECP, recall that by definition of the edge spaces we have 
ei = Yk{Mi,Gi,Mi+i). 

(a) We proceed as follows. For a matrix M € F'^^'' denote by col(M) C its column space 
and by col(M, t) its t-th column. Fix i G I. Let := rk {Mi, Gi). In a first step we will show 
that Ci = Si + if and in a second step we will prove that Cj = e^. For the first step, notice 
that Ci = Si if Gi G col(Mj) and Sj + 1 else. Moreover, the definition of the matrices Mj 
implies 

Gi G col(Mi) gii = for aU I ^ d. 

Since {ai,bi] is a span for the row gi one has gu = for all i [o;, bi] and gi^ai / 0. But 
then the above along with [ai, bi] U {a;} = [a/, 6;] shows that Gi G col(Mj) <^=^ i ^ A. All 
this proves ej = Sj + if. In order to establish Cj = it remains to show that col(Mj_|_i) C 



col(Mj,Gi). From ( |3^ we know that col(Mi+i,/) = if / A+i and col(Mi+i,/) = ei 
if / G where £/ G F'^ denotes the k-th. standard basis vector. Hence fix / G If 

I G Ci, then col(Mj+i,/) = ei = col{Mi,l) G col(Mj,Gj) and we are done. If / Ci, then 
i ^ iai,bi] and thus col(Mi,^) = 0. But then the assumption i + 1 £ («/,&;] implies that 
i = ai and thus gu ^ 0. In order to examine the other entries of the column vector Gi 
let r G k\{l}- If r G A then col(Mi,r) = Sr- If r ^ Ci then i {ar,br] and since 
i = ai, which is distinct from a^, this implies i [a,.,?),.], hence gri = 0. All this shows that 
Gi-Ylr=i 9riCol{Mi, r) = gii£i = giiCo\{Mi+i,l). As a consequence, col(Mi+i, /) G col(Afj, Gi) 
and therefore = rk {Mi, Gi, Mj+i) = rk {Mi, Gi) = Si + if. 
(b) We proceed similarly using the rank ej := rk {Gi, Mj+i). This time we have 

Gi G col(Mj_|-i) <;=^ gu = for all / such that i + 1 {ai, bi] 

*^==^ gu = for all I such that i ^ {ai — 1, 6^ — 1]. (3.6) 
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The latter semi-open interval is [a/, bi — 1] if ai ^ hi and empty otherwise. Since gi^hi 7^ (3.6) 
implies the equivalence Gi E col(Mj_|_i) <;=^ i ^ B. Hence = Sj+i + if and it remains 
to show that col(Mj) C col(Gi, Mj+i). Fix / G d. Then col(Mj,/) = e^. If I £ d+i then 
col(Mj+i,/) = £[ and we are done. If I d+i, then i + 1 {ai,bi], but i G thus 
i = hi. In this case col(Mj+i, /) = and gn ^ 0. For the other entries of Gi pick r £ A; \{/}. 
If r E A+i then col(Mj+i, r) = Sr- li r ^ d+i, then i + 1 {ar, br], hence i (a,. — 1, 6,. — 1]. 
But since i = bi is different from br this implies i [a,-, ^r]- Hence gri = 0. All this shows that 
Gi - J2r=i 9riCol{Mi+i, r) = giiSi = giiCol{Mi,l) and therefore col(Mi,/) G col(Gi, Mj+i). In 
particular, rk (M^, Gi, M^+i) = rk(Gi,Mi+i). 

(c) The identities for Si follow from equating the expressions for a in (a) and (b). Further- 
more, in (a) and (b) we saw that rk(Mj,Gj) = rk(Gj,Mj+i) = rk (Mj, Gj, Mj_|_i). This in 
turn implies ker(Mj,Gj) = ker(Gi, Mj+i) = ker(Mj, Gj, Mj+i), from which the biproperness 
follows. □ 

At this point we wish to show a path property of product trellises that will prove useful later 
on when discussing mergeability. The property holds true under a certain condition on the 
vertex spaces, satisfied by product trellises Tg^Si but also by the BCJR-trellises studied in 
the next section. In system theoretic language this path property means that the trellis is 
point controllable (in n steps) in the sense of [IS, p. 188]. 



Lemma 3.6 Let C = imG and let the span list S be as in (3.3). Let T = {V,E) be a trellis 
for C such that the vertex and edge spaces are given byVi = im Mj and Ei = im (Mj, Gj, Mj+i) 
for certain matrices Mi G jp'^x''. j;,et the sets d be defined as in (3.5) and suppose that for 
all i € I the l-th row of Mi is zero whenever I ^ d- 

Then for every v G Vq = im Mq there exists a path through the trellis T that starts at v ^Vq 
and ends at G Vq. Precisely, there exist vectors a^^\ . . . , a^"""*^) G F'^ such that 



a 



(°)Mo = t;, a(^) -a(^+^) G ker Mi+i fori = 0,...,n- 2, a^^-^^Mo = 0. 



Proof: In the sequel, for any vector a G F'^ let ai denote the Z-th coordinate of a. By 
assumption 

{a G F'' I a; = for / G d} ^ ker Mi for all i G I. (3.7) 

Let a^^^ G F'^ such that a^^^Mo = v. By assumption on the rows of Mq we may assume 
without loss of generality 

a[°^ = for / 0£o- (3.8) 

Define a(^) G F^ via 



(0) 

' ^ (3.9) 
0, if/0£in£o. 



(1) _ I a)''', if / G £i n£o 



Then and K9h show that - af^ = for Z G £i and thus a^^) - a(°) G ker Mi. 



Assume now we proceed in this way and have constructed a^^^ G F'^ such that 

' \ 0, if^^nioA 
and such that a^^^ — a^^~^^ G kerMj. Then put 

^/ ^^""^^ (3.11) 



0, if/^ns^ 
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a 



(i) 



for all / G flS by ([3lT|) and 



O'+i) 



a 



(i) 



for all / G 



Then (a(^+i) 

£.j+i\f]{^QCi by ( |3l0l ) and plTj ). Hence (a(j+^) - a^^)); = for all I £ Cj+i and thus 
Q,(i+i) _ Q,(j) g kerMj+1. Thus, proceeding with this construction we find a^""^) G F'^ such 
that a("-i) - q("-2) £ kerMn_i and 



^(n-i) = for / Pi A 



n-l 



i=0 



But then a*^" ""^^ = because PlILo^ = {/ G A: | i G (a^, 6/] for all i G X} = 0, where the 



latter is due to the fact that (a, b] 7^ X by definition of half-open intervals in (3.1). Hence 
a("-i)M„ = as desired. 

Observe that the proof actually shows that there is even a path from u G Vq to G Vn-i 
because a^"-'^^ =0. □ 

A main result of [9] is the construction of a certain characteristic matrix for a given code from 
which all minimal trellises can be derived. We will define the characteristic matrix slightly 
differently, namely based on certain properties rather than on the outcome of a particular 
procedure. This will facilitate later considerations. The existence of this object will then 
follow from the algorithm derived in [9]. However, some care has to be exercised due to 
the fact that our characteristic matrix will not be uniquely determined by the code (see 



Example 3.12 below) 



Definition 3.7 Let C C F" be a A;-dimensional code with support I. A characteristic pair 
of C is defined to be a pair {X, T), where 



X 



Xi 



G F"^" and r = [iai,bi], I = 1, . . . 



n 



(3.12) 



\xr. 



have the following properties 

(i) imX = C, that is, {xi, . . . , Xn} forms a generating set of C. 

(ii) (oi, bi] is a span of a;; for / = 1, ... , n. 

(iii) ai, . . . , a„ are distinct and 61, . . . , 6„ are distinct. 

(iv) For all j G X there exist exactly n — k row indices li, . . . , In-k such that j G (a^. , 6;.] for 
i = 1, . . . , n — fc. 

We call X a characteristic matrix of C and T the characteristic span Ust. The span matrix 
of the characteristic pair {X, T) is defined to be the matrix 



S = (sij)i=i,....n G Nq^", where sij 

j—O,.. . ,n — 1 



1, if j£{ai,bi], 
0, if j^{ai,bi]. 



As we will see below, the characteristic span list is, up to ordering, uniquely determined by 
the code C. This is one of the results proven in [9]. The only difference between our definition 
and the one of Koetter/Vardy is that in [9] for each span (a;, bi] the corresponding row in X 
is the unique lexicographically first codeword having this span (with a slight modification in 
the non-binary case). As Koetter/Vardy point out in ^ Remark, p. 2095] there is no need 
for this particular choice. 

Here are some first simple properties of characteristic pairs. 
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Remark 3.8 (a) Every column of the span matrix S has exactly n — k entries equal to 1 
and k entries equal to 0. This is simply a reformulation of Definition |3.7|(iv). 



(b) Let a be the left shift on F" as in Remark 3.4, If (X, T) as in (3.12) is a characteristic 



pair for C, then (X*,T*) is a characteristic pair of cr(C), where 



X* :-- 



/ct(xi)\ 



and 7^:= [(a, - 1, k - 1], I = 1, . . . , 



n . 



(c) 



By part (iv) of the definition there exist exactly k linear spans in the span list T (recall 
that (a, b] is linear iff (a, b]). Therefore and due to (iii), the corresponding rows of X 
form an MSGM of C in the sense of [12^ Def. 6.2, Thm. 6.11]. Likewise, the previous item 
shows that for each j = 0, . . . , n — 1 a characteristic matrix contains a shifted MSGM for 
the shifted code cr^{C). In this sense, the rows of a characteristic matrix may be called a 
set of "shortest generators" of C (and all its cyclic shifts) in the sense of [3]. 



So far we have not established the existence of characteristic pairs for a given code. This 
crucial result, along with the uniqueness of the span set, has been proven in [9] and an 
algorithm has been given for computing a characteristic matrix. All this leads to the following 
result. 

Theorem 3.9 Let C C be a k-dimensional code with support I. Then C has a character- 
istic pair and the characteristic span hst is, up to ordering, uniquely determined by C. Thus, 
the rows of the span matrix S in DeGnition 3.7 are uniquely determined up to ordering. 



Proof: The existence of a characteristic pair with the properties as in Definition |3.7[ i) - 
(iv) has been shown in [9], Lem. 5.7, Thm. 5.9, Thm. 5.10 and its proof]. Since our notion of 
characteristic pair is slightly more general than the one in [9] , we need to pay special attention 



to the uniqueness of the characteristic span list. In order to do so let {X,T) as in (3.12) 
be a characteristic pair of C. Fix j G {0, . . . , n — 1} and choose the k indices h, . . . ,lk for 
which j (oiij^zj; they uniquely exist due to property (iv). Then (a;. — j,b^ — j] and 
the latter is a span for the shifted row a^(xi.). Hence those shifted rows have linear spans 
and, due to property (iii) they form an MSGM of cr^{C) in the sense of |12[ Def. 6.2]. By \12\ 
Thm. 6.11] the shifted span list is uniquely determined by the code cr^{C) and hence by C. 
All this shows that the span list 7" is uniquely determined by C and coincides with the one 
of the characteristic matrix as defined in [9l Def. 5.2]. □ 

The matrix X of a characteristic pair is, in general, not uniquely determined by C (up to 



ordering of the rows), see Example 3.12 below. 

The importance of the characteristic pairs is that all minimal trellises of the code can be 



retrieved from them, 
following notion. 



This is one of the main results in 



Let us first introduce the 



Definition 3.10 Let C C F" be a /c-dimensional code with support I and let (X, T) be a 
characteristic pair of C as in (3.12). Any trellis of the form T^;^,(a;^,b;j x . . . x T^, ,(a; ,6; ], 
where x^j, . . . are linearly independent rows of X, is called a KV^x,T)'^^^^^^^ ^- Every 
trellis that is a KV(x,T) "trellis fo'^ some characteristic pair (X,T) of C is called a KV-trellis 
of C. 
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Remark 3.11 By Theorem 3.3 every KV-trellis is one-to-one. Moreover, if T = T^. 



is a KV(x,T)"t^6llis, then the SCP of T is given by {vi, . . . ,Vn)S, where S 
is the span matrix of the characteristic pair {X, T) and Vi = 1 ii i £ {li, . . . , Ik} and Vi = 
else. This follows immediately from Theorem 3.3 As a consequence, the SCP (sq, . • • , Sn-i) 
of a KV-trellis satisfies Sj < mm{k, n — k} for all i £ I, which generalizes the well-known 
Wolf bound [20] (see also |18| Thm. 5.5]) to tail-biting trellises. Finally, since a KV-trellis is 
a product trellis where the starting points of the spans are distinct and so are the end points. 



the formulas for the ECP and SCP given in Proposition 3.5 apply. 



Unfortunately, the set of KV(x,T)"t^6llises depends on the choice of the characteristic pair 
{X,T)- Indeed, we have the following example. 



Example 3.12 Let F = Fa and C 

{X,T) and {X',T), where 



X 



/I 


1 





0^ 




A 


1 





0\ 





1 


1 


1 


,x' = 





1 


1 


1 


1 





1 


1 


1 





1 


1 


Vo 


1 


1 


V 




^1 





1 


1/ 



{(0000), (1100), (0111), (1011)} C F^. The two pairs 



T= [(0,1], (1,3], (2,0], (3,2]], (3.13) 



are both characteristic pairs of C. Their common span matrix is 



S 



/O 1 0\ 

11 

10 1 

Vi 1 1 0/ 



The last two rows of X are linearly independent and lead to a KV(x,T)"t^^llis with SCP 
(2, 1, 1, 1); see Remark 



3.11 



But since the last two rows of X' are linearly dependent, this 
SCP does not appear for any KV(x',T)"ti"ellis of C as one can see directly from the span 
matrix S. It is easy to see that the two matrices X and X' are the only two characteristic 
matrices for the code C, up to ordering of the rows. Indeed, examining the set of codewords 
it is clear that for the spans (0,1], (1,3], and (2,0] there is exactly one codeword having 
this span, while the span (3,2] is attained by the two codewords (0111) and (1011). This 



leads exactly to the two choices of characteristic matrices given in (3.13). Since (0111) is the 
lexicographically first of those two vectors, the matrix X is the characteristic matrix singled 
out in [S]. 

The following result expresses the fact that the list of spans uniquely determines the structure 
of KV-trellises. For structural isomorphisms recall Definition |2.1[e). 



Corollary 3.13 Let T = Tq^s and T' = Tqi gi he KV-trellises of C. Then the following are 
equivalent: 

(i) T and T' are structurally isomorphic, 
(a) T and T' have the same SCP and ECP. 
(Hi) S = S' , up to ordering. 

Proof: (i) (ii) is obvious. 

(ii) =^> (iii) From Proposition 3.5 a) it follows that the sets A and A' of starting points of the 



spans in S and S' , respectively, are identical. But for KV-trellises any starting point uniquely 
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determines its span, see Theorem 3.9 Hence S = S' , up to ordering. 



(iii) ^ (i) Without loss of generality, let us assume S = S' . Then Mj = for the product 



trellis matrices of T and T' , respectively, defined in (3.4). But then the edge spaces of T 



and T' are given by Ei = im (Mj, Gj, Mj+i) and E'- = im (Mj, G^, Mj+i), where the i-th 



column of G and G' is denoted by Gi and G[, respectively. By Proposition 3.5 we have 
dimEi = dimE-. This shows that 4>i{a{Mi,Gi, Mi+i)) = a{Mi,G[,Mi+ i) is a well-defined 
structural isomorphism between T and T' . □ 

It is worth observing that the implication (ii) =^ (iii) is not true in general for product trellises. 
For instance, the two matrices 



'1 1 1 0\ /O 1 V 

1001 1,G' = 001 11 
,1110 1/ \1 1 1 1. 



generate the same code of dimension 3. Choosing the span list S = [(3, 1], (0, 4], (4, 2]] for G 
and S' = [(4, 1], (3, 2], (0, 4]] for G' results in product trellises Tg,s and Tg\s' with the same 
SCP (2,3,2,1,2) and ECP (3,3,2,2,3) even though S ^ S' . However, one can check that 
the two trellises are not structurally isomorphic. As a consequence, the question whether (i) 
above is equivalent to (iii) remains open for (one-to-one) product trellises. 

Let us now state the main result of [9]. 

Theorem 3.14 ([ST, Thm. 5.5]) Let C C F"' be a k-dimensional code with support I and 
let T he a minimal trellis of C in the sense of Definition \2.^ a). Then T is one-to-one and 
there exists a characteristic pair {X,T) such that T is a Ky(x,r)"t^6]iis. 

Proof: Since our result is phrased differently from what is in ^ Thm. 5.5] we think it 
is worthwhile to give a proof. First of all, T is a product trellis by [9l Thm. 4.2] and is 



one-to-one due to Lemma 5.2]. Hence T is a product trellis as in our Theorem 3.3 



because, due to minimality, the trellis is non-mergeable and thus we may assume shortest 



spans for all generators; see also the paragraphs following Theorem 3.3 Thus, T = T^j^^(^ai,bi] ^ 



X T^^^(aj^,bj^] for certain xi G C with spans {ai,bi]. Now [91, Lemma 5.4] implies that 
(xi, (oi, 6i]), . . . , (xfc, (ofc, 6fc]) are characteristic generators in the sense of [9, Def 5.2]. But 
this simply means that this list can be extended to a characteristic pair {X, T) of C in the 



sense of our Definition 3.7 and thus T is a KV(x,T)"trellis. □ 



One should bear in mind that not all KV-trellises are minimal. For instance, the last two rows 



of X in Example 3.12 are linearly independent and give rise to a trellis with SCP (2, 1, 1, 1), 
but the first two rows of X are also linearly independent and lead to a trellis with SCP 
(0, 1, 1, 1). This shows that the first trellis is not minimal in the sense of Definition |2.2| [^a). A 
larger example is given in [9, Exa. 6.1]. 

It is important to notice that the above result does not imply that all minimal trellises of C 
arise from a fixed characteristic pair. Indeed, we have the following example. 

Example 3.15 Let F = F2 and C := imG, where 

/O 1 1\ 
G = 1 1 1 G F^^^ 
\1 1 1 0/ 
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Put 



X 



/I 1 0\ 

1 1 

1110 

10 10 

1 1 

Vi 1 1 0/ 



X' 



/I 1 0\ 

1 1 

1110 

10 10 

1 1 

Vi 1 1 1 1/ 



and T = [(0,3], (1,5], (2, 4], (3,0], (5,1], (4, 2]]. It is easy to verify that {X,T) and {X' ,T) 
are both characteristic pairs of C. Put S = [(1, 5], (2, 4], (4, 2]] . Then the product treUises 
T = Tg.s and T' = Tq/^^, where 

/O 1 1^ 
G' = 1 1 1 
\1 1 1 1 1^ 

are both KV-trelhses based on the same choice of characteristic spans; the first one is a 
KV(x,T)"t^6llis and the second one is the corresponding KV(x',T)"t^6llis- The KV-trelhses 
are minimal as one can check by going through the SCP's of all possible KV(x,T)-trellises. 
(There are 10 choices for selecting 3 linearly independent rows in X and none of them leads 
to a smaller trellis.) As one can verify directly from the graphs below, the trellises T and T' 
are structurally isomorphic, but not isomorphic. 



110 110 




110 110 



001 001 
0. 4- 

000 000 000 000 000 000 000 

[Tg,s] 




001 001 

» V- 

000 000 000 000 000 000 000 



In light of the previous example it should be observed that, at this point, the following 
question is open. Given a code C C F*^ and fix a characteristic pair (X, T). Is then each 
minimal trellis of C structurally isomorphic to a KV(x,T)"trellis? From Corollary 



3.13 



we 



know that the SCP and ECP of the given trellis uniquely specify the row selection of X that 
may result in a potentially structurally isomorphic KV(x 7-) -trellis T' . But it is not clear 
whether this row selection is linearly independent. Only then the trellis T' represents C and 
thus is a KV(x,T)"t'^6llis. 



Remark 3.16 As shown in [9l Thm. 5.6], the same result as in Theorem 3.14 applies to 
many other notions of minimality for tail-biting trellises, see [9l p. 2085]. 



4 The Tail-Biting BCJR- Construction 

In this section we will present a different way of producing tail-biting trellises for a given code. 
These BCJR-constructions were introduced in [15]. We will show that the resulting trellises 
are non-mergeable and that product trellises can always be merged into a corresponding 
BCJR-trellis by taking suitable quotients. We will see that KV-trellises are isomorphic to 
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their BCJR-counterpart, and, consequently, non-mergeable. Throughout let C be as in (2.1) 
and K^. 



Theorem 4.1 ( |15|, Lemma 2]) Let D G f'^^*^" be any matrix and for i G X deGne the 
matrices 

_ ( D, ifi = 

Then Nn = Nq and the treUis T := T(^g,h,d) with vertex spaces Vi := im A'^j C ip"-*^ st.nd edge 
spaces Ei = im {Ni, Gi, Ni^i) is a hnear, reduced and biproper trelhs representing the code C, 
that is, C{T) = C. We call D the displacement matrix for the trellis T. 

The fact Nn = Nq simply follows from the identity Ni = Nq + Z]j=o ^i^i^ ~ 1; • ■ • 
along with = GH^ = X]j=o ^i^i- "^^^ reducedness and biproperness are a consequence 
of the very definition of the vertex and edge spaces along with the recursive definition of the 
matrices Ni . The displacement matrix D can be interpreted as the (circular) past at time zero 
of each cycle in Ti^g,h,d)- If -D = 0, each cycle starts with zero history and the trellis T(^g,h,d) 
is conventional. In fact, it is simply the classical BCJR-trellis and thus a minimal trellis, 



see |12[ Sec. IV] and [T]. This case will also show up as a special version of Definition 4.5 
below, in which we define what will be called a tail-biting BCJR-trellis. One should regard 
the displacement matrix D as a design parameter for the construction of good trellises. In 



Definition 4.5 we will restrict ourselves to particularly useful choices of this parameter, which 



then will always lead to non-mergeable trellises. 

Remark 4.2 For every U G GLk{¥) the trellises T(^g,h,d) ^-^d T(jjg^h,ud) ^-^'s identical and 
for all V G GL„_fc(F) the trellises Ti^g,h,d) T(^g,vh,dv''') isomorphic and the iso- 
morphisms on the vertex spaces are given by the restrictions of the linear map to those 
spaces. 



The statement C(T) = C of Theorem 4.1 also follows from the following result which will be 
helpful later on. 



Proposition 4.3 Let T = T(^g.h,d) be as in Theorem 4.1 Furthermore, let a^*) G be 
such that the sequence of edges (a^TVi, a»Gi, a«iV,+i), i G X, is a path through T, that 
is, A'j+i = q;(*+-^) A^j+i for i = 0, . . . ,n — 2. Then the edge-label sequence of the path is a 
codeword if and only if the path is a cycle. In other words, 

(a(°)Go, . . . , a("-^)G„_i) G C ^ a("-^)iVo = a^^^iVo- 

Proof: From the assumptions as well as the definition of the matrices Ni we obtain a^^'^Ni = 
a(o)iVi = a(°)iVo + u^^^GqEq and inductively a^'+^^Ni+i = a^^^iVo + T!j=Qa^^^GjHj, for 
i = 0, . . . , n - 2. Hence a("-i)iV„_i = u^^'^Nq + ^""^ a^^^GjHj and this implies 

n-1 
j=0 

This leads to o^^-^^^Nq = o^^^Nq <^ ^21=0 ct^'^GiHi = and the latter is the case if and 
only if (a(°)Go, . . . , a^'^-^^Gn-i) G ker = C, as desired. □ 

One of the nice properties of the trellis construction in Theorem |4.1| is that it respects duality. 
Indeed, the following is straightforward; see also \15\ Lemma 12]. 
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Proposition 4.4 Let T = T(^g,h,d) be as in Theorem 4.1 



Then T := T, 



is a treUis 



representing the dual code C^. Moreover, the vertex spaces of T are given by iuiNj^ , i ^ I, 
and thus the treUises T and T have the same SCP, given by (so, • • • , Sn-i), where Si = rkNi. 
As a consequence, ifT is a minimal trellis for C, then T is a minimal trellis for C-*-. 



Later on in the proof of Theorem 5.3 we will see that in the special case where T(^g,h,d) is 
a minimal KV-trellis, this trellis dualization coincides with the construction given by Forney 
in [5l Sec. VII-D and Thm. 8.4] based on dualizing the edge spaces (local constraints) with 
respect to a particular inner product. 

A particular instance of the trellis T(^g,h,d) is obtained by choosing the displacement matrix 
based on given spans of the generator matrix G. This will relate this construction to the 
product construction of the previous section. 

Definition 4.5 Let S := [(a;, 6;], / = 1, . . . , A;] be a span list of G. Then the trellis T(^g,h,s) 
defined as T(^g,h,d)i where 



D 



(di 



\dkj 



n-l 



G F'^^^"-*^) with di=^ gijHj, 



(4.2) 



is called a (tail-biting) BCJR-trellis of C. 



It should be noted that we define a BCJR-trellis more restrictive than Nori/Shankar in |15j : 
while we use this terminology only for trellises T(^g,h,d)^ where the displacement matrix 
is based on a given span list as in (4.2), Nori/Shankar use the term BCJR-trellis for all 
displacement matrices. 



4.2 



it follows that Ti^g,h,s) is isomorphic to T(^g,vh,s) ^^r all 



Remark 4.6 From Remark 
V G GL„_fc(F). Indeed, if D is as in (4.2) then DV^ is the according displacement matrix 



for Ti 



(G,VH,S)- 



As a consequence, restricted to all vertex spaces furnishes an isomorphism 



between the two trellises. However, if G and UG ^ ¥ , U G GLfc(F), are two generator 
matrices with the same span list S, then the BCJR-trellises T(^g,h,s) ^-iid T^UG,H,S) need not be 
isomorphic. This can be seen from Example 3.15, Indeed, the two (non-isomorphic) trellises 



given in that example are BCJR-trellises, which will follow from Theorem 4.12 presented 
below. 



Notice that if all spans are linear, then D is the zero matrix (see also Proposition 4.8 below) 



and the resulting trellis is the conventional BCJR-trellis. It is well-known that the conven- 
tional BCJR-trellis is minimal, see \12\ Sec. IV], and thus non-mergeable. Later on we will 
see that in the tail-biting case, BCJR-trellises are non-mergeable as well, but they are not 
necessarily minimal. 



Just like the product trellises in Remark |3.4| the BCJR-construction is compatible with the 
shift. 



Remark 4.7 Let S and D be as in Definition |4.5 
matrix consisting of the shifted rows <j{gi), I 
S* := [{ai — 1, bi — 1], I = 1, . . . , kj is a span list for the rows of G*. Let Ni, i G X, be the 



As in Remark \3M let G* G F^^" be the 
1 , . . . , fc and define H* accordingly. Then 



vertex matrices as in (4.1) for the BCJR-trellis T(^g,h,s)- Then the vertex matrices for the 
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BCJR-trellis T(^g*,h*,s*) are given by A^* = iVj+i for i e I. Indeed, by the l-th row of 



A'^Q is defined as 



n—l n— 1 n—1 

3=0.1-1 j=ai-l j=ai 



and the latter is the l-th. row of A'^i. Now the rest follows inductively due to (4.1). 

In the sequel we will show that if the pair {G,S) originates from a characteristic pair of C, 
then the trellis T(^g,h,s) is isomorphic to the corresponding KV-trellis Tq^s- 

We begin by showing the following. 



Proposition 4.8 Let T(^g,h,s) be as in Definition 4.5 and let Ni, i EX, be deGned as in 
Theoreni\4.1l Then 



(a) If (a/, bi] is a linear span, then di = 0. 

(h) Ifi€zl and I € {1, . . . ,k} are such that i ^ {ai, bi], then the l-th row of Ni is zero. 

Proof: (a) If (a;, bi] is linear then all nonzero entries of gi are in the interval [ai,n — 1] and 
therefore di = gijUj = I]j=o 9ijHj = due to Gif = 0. 

(b) Recall that Nq = D as well as the fact that {ai,bi] is a linear span iff ^ (a;, bi]. 
Therefore, the case i = has been proven in (a). Thus, let i > 1. Then, by definition, 
N^ = D + ZT=lGjHr 

Suppose first that the span (a;, bi] is circular, hence in particular, non-empty. Then i ^ (ai, bi] 
is equivalent to i G {bi,ai], which is linear; see also (3.2). Thus, the l-th row of Ni is 

i— 1 n—l i— 1 n—l 

di + Yl SijHj = Yj 9ijHj + gijHj = Y 9ijHj = 0, 

j=0 j=ai j=0 j=0 

where the second identity holds true because gij = for j E [i, a/ — 1]. 

Let now the span (a/, bi] be linear. In this case = by part (a) and the l-th row of Ni is 
given by 

i-l 

YgijHj. (4.3) 

j=0 

Now i {ai, bi] amounts to either i G [0, a/] or i G {bi,n — 1]. In the first case, the interval 
[0,i — 1] and the closed span [a/, bi] of gi are disjoint while in the second case the interval 
[0,i — 1] contains [0^,6/]. Hence in both cases the sum in (4.3) is zero. □ 

Now we are in a position to prove the non-mergeability of the BCJR-trellises. This result 
generalizes the well-known fact that the conventional BCJR-trellis is minimal (thus non- 
mergeable), see \12\ Sec. V]. It is worth noticing that, just like in the conventional case, the 
corollary is true even without assuming that the starting points ai, . . . , of the spans (or 
the endpoints bi, . . . ,bk) are distinct. 

Corollary 4.9 Any BCJR-trellis T := T(^g,h,s) non-mergeable. 

Proof: Let Vi be the i-th vertex space of T(^g,h,s)- Let us first suppose that the ver- 
tices V, v' G Vq are mergeable. Then the edge-label sequence of each path through T starting 
at V £ Vo and ending in v' G Vq must be a codeword in C (because after merging this path 
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would be a cycle) . But then by linearity of the trellis the edge-label sequence of each path 
through T starting at v — v' £ Vq and ending in S Vq must be a codeword in C as well. Due 



to Proposition |4.8[ b) we may apply Lemma 3.6 and conclude that such a path does indeed 
exist in T. But then Proposition 4.3 implies that this path is a cycle. Hence v — v' = 0. 
All this shows that no vertices in Vq are mergeable. Now the non-mergeability of vertices in 
any Vi follows from the shift property of T as described in Remark |4.7[ □ 



It should be noted that if the displacement matrix D is not as in Definition |4.5[ then the 
as in Theorem 



trellises T/ 



iG,H,D) 



4.1 



are, in general, mergeable even if the starting points 
(resp., end points) of the spans are distinct. An example is given in [15, Exa. 9, Fig. 14]. 
One can easily see how to merge that trellis. Moreover, that trellis has SCP (2, 2, 2, 2) and 
therefore Proposition |4.8[b) implies that the underlying displacement matrix is not as in 



Definition 4.5 and thus the trellis is not a BCJR-trellis in our sense. Furthermore, that trellis 



does not contain any paths connecting a nonzero vertex in Vq with G Vq and therefore the 
main argument of the proof above does not apply. 

At this point it should be mentioned that BCJR-trellises are in general not one-to-one, not 
even if the starting points (resp. endpoints) of the spans are distinct. This is illustrated by 
the following example taken from [15, Exa. 8]. 

Example 4.10 Let C = imG = kev , where 

G=(\ J J)gF2x3, and5=[(0,2], (l,0]],if=(l 1 l) . 
The BCJR-trellis T := Trc H s) ^^-s edge spaces Ei = im {Ni, Gi, Ni^i) given by 



En 



im 



E, 



im 



Eo 



im 



This shows that l-^^l-^l-^^l is a non-trivial cycle in T with the zero codeword as edge- 
label sequence; see also the figure below. Hence T is not one-to-one. Let us also consider the 
associated product trellis T = Tq s- In this case the edge spaces are given by 



En 



im 




1 



1 1 
1 



^1 



im 



1 




1 
1 



Eo 



im 



1 1 
1 




1 



From Theorem 3.3 we know that T is one-to-one; see figure below. However, the trellis T is 
mergeable at time i = 2 and indeed, merging the states (11), (00) and the states (10), (01) 
(that is, replacing V2 by the quotient space V2/ini(ll)) leads to the (not one-to-one) trel- 
lis T = T(^G,H,S)- the following graphs as well as in all later graphs a solid line denotes an 
edge with label 1, while dashed lines denote the label 0. 






00 



[Trellis t] 



[Trellis T] 



[Conventional Minimal Trellis] 
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The trellis T, being a BCJR-trellis, is non-mergeable. However, it is not minimal as the 
conventional minimal trellis of C at the very right shows. It should also be noted that T is 
not (isomorphic to) a KV-trellis. Indeed, as one can easily verify, the unique characteristic 
pair of C with span matrix S is given by 



^0 1 1 

(X, r) = ( ( 1 1 I , [(1,2], (0,1], (2,0] 
,1 1 



s 




From this it is obvious that the SCP (1,1,2) of T cannot be attained by a KV-trellis and 
neither can the SCP (1, 1, 1) of the trellis T. Moreover, the trellises T and T cannot be 
merged into a KV-trellis of the code. 
On the other hand, in Theorem ■ 



4.14 



we will see that the process of merging a product trellis T 
into the corresponding BCJR-trellis T is always possible (if T is mergeable at all) and can 
be achieved by taking a suitable quotient trellis. 



4.5 



Theorem 4.11 Let S := [(a/,5i], / = 1, . . . , A;] and T := T^g,h,s) Definition 
Let T be the product trellis T := Tq^s- Then the vertex spaces Vi := imA^j and Vi : 
im Mj , i E X, of r and T, respectively, satisfy 



dim Vi < dim Vi for i S X. 



(4.4) 



If equality holds true for all i ^ I in (4.4), that is, T and T have the same SCP, then one has 
the following. 

(a) T and T are isomorphic, thus the product trellis T is non-mergeable. 

(b) ai, . . . , Ofc are distinct and so are bi, . . . ,bk. Therefore, the formulas for the ECP and 
SCP given in Proposition \3.5\ apply. 



It is worth noting that T T with respect to the 0-ordering defined in [^l p. 2084]. While 
we have seen in Proposition |3 . 5| that if both oi, . . . , and hi, . . . ,bk are distinct, the formulas 
for the ECP do always apply to product trellises, this is not the case for the BCJR-trellis; 



see for instance Example 4.10 



Proof: First of all, by Proposition |4.8[ b) and the very definition of the matrices Mi in (3.4) 
we have 

col(A^i) C col(Mi), (4.5) 
where col(M) := {Mw \ w € F''} denotes the column space of the matrix M G F'^'^^ 



Hence 



rk(iVi) < rk(Mj) and this proves (4.4). 



Let us now assume that we have equality in (4.4) for all i G X. 
(a) The inclusions (4.5) imply 



col(iVi) = col(Mi) for i e X 



(4.6) 



and hence kerA^j = kerMj, where, as usual the kernel denotes the left kernel and thus is 
simply the orthogonal of the column space with respect to the standard inner product on F*^. 
As a consequence, the map : Vi — > Vi, aMi i — > aNi is a well-defined isomorphism. 
But then the maps (j)i give rise to an isomorphism between T and T because the edge spaces 
of T are given by Ei = im (A^,, Gj, iVj+i) whi le t hose of T are Ei = im (Mj, Gj, Mj+i). The 
non-mergeability of T follows from Corollary 4.9 
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(b) Let the index sets A be as in ( |3.5[ ) again. Then rk (Mi) 
by assumption rk(A'j) = rk(Mj) Proposition |4.8[b) implies 



|£i| by Proposition 3.5 Since 



the rows of Ni with indices from £j are hnearly independent. 



(4.7) 



Assume first that for some I ^ m we have a/ = am- Put z = a/ + l = am + l. Then the l-th 
row of Ni is given by 



n-1 



i-1 



3=0-1 j=0 

and Hkewise the m-th row of Ni is 



n-1 

E 

j=am 



i-1 



9m j Hj + ^ ^ 5'mjr -^jr — 9mai H, 
j=0 



(4.8) 



Hence these two rows are hnearly dependent. Observe that if ai ^ hi and ^ bm then 
I, m G Ci and therefore this hnear dependence contradicts (4.7). If ai = bi = am ^ bm, then 
{ai, hi] = and m £ Ci, but I ^ Ci. However, in this case the row gi is nonzero only at the a/-th 
position and this forces = 0. Again, (4.8) contradicts (4.7). The case bi ^ ai = am = bm 
is symmetric and the case ai = bi = am = bm cannot occur due to the linear independence of 
the rows gi, gm- 

Finally assume that for some I ^ m we have bi = bm- Then for i = bi = bm the l-th. row of Ni 



is Ej=i 9ijHj + Ej=o 9ijHj 
we may argue as in the first case. 



-gib^Hin and likewise the m-th row of Ni is —gmbiHbi- Now 

□ 



In the proof we used the fact that if equality holds in (4.4) then we have (4.7). With the aid 
of Proposition |4.8[ b) this yields 

keriVi = {a G I a/ = for / G (4.9) 

Now we are in a position to prove that KV-trellises are BCJR-trellises. 



fcxn 



Theorem 4.12 Let (X, T) he a characteristic pair of C as in Definition 3.7 Let G G F 
consist of k linearly independent rows of X and let S be the list of corresponding spans taken 
from T- Then the trellises Tq^s ^.nd T(^g,h,s) isomorphic. As a consequence, KV-trellises 
are BCJR-trellises and thus non-mergeahle. 



Proof: The last statement follows from Corollary 4.9, For the first part it suffices to prove 
that rk (Mi) = rk (N) for all i £ I. Indeed, then the result is immediate with Theorem 4.11 
We will proceed in several steps. 



1) By Definition |3.7[ iv) we may assume without loss of generality that the first k rows of X 
have linear span (thus ai < hi for I = 1, . . . , /c) and the last n — k rows have circular spans. 
Then those first k rows xi, . . . , j;^ are linearly independent and form an MSGM of C, see \\.2\ 
Def. 6.2, Thm. 6.11]. Define the matrix Z G F^'""'')^ via 



Zl 



\ 



Z 



where zi 



n-1 

E 

j=a,k+l 



Xk+i,jHj 



(4.10) 



(where Xk+i 
gular. 



{xk+ifl, . ■ ■ , Xk+i^n-i) is the {k + /)-th row of X). We claim that Z is nonsin- 
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In order to see this, assume PZ = for some /3 = . . . , fin-k) S F" . Hence 

n—k n—1 



0. 



(4.11) 



1=1 



For I = 1,. . . ,n- k define Xk+i := (0, . . . , 0, Xk+i,ak+i j • • • > e F", that is, Xk+i is the 

vector with first a^+i entries equal to zero and the last ones identical to those of x^+i- Notice 
that by definition of the span of a vector Xk+i^a^+i 7^ ^-iid remember that a^+i, . . . ,an are 
distinct by definition of a characteristic pair. Therefore, x^+i, . . . ,Xn are linearly independent. 



Moreover, (4.11) implies 



n—k n—1 



^ A ^ Xk+LjHj 



n—1 n—k 



E 



1=1 



j=0 



/ ^ PiXk+i,j)Hj. 

j=0 1=1 
n—k 



But this means that x G keri/'' = C, where x := ^]^Zi Pi^k+i- If we can show that 5; = 0, 
then the linear independence of ifc+i, ■ ■ ■ ,Xn implies (3 = 0, which in turn proves that Z is 
nonsingular. We proceed by contradiction and assume x ^ 0. Then x has a unique linear 
span, say (a, b], where, by definition of x, 



a = min{aA:+; | A 7^ 0}. 



(4.12) 



On the other hand, the first k rows of X form an MSGM of C and therefore the codeword x is 
of the form x = X]f=i lixi for some 7; G F. Now the predictable span property of an MSGM 
\\.2\ Lem. 6.6] implies that a 
that oi, . . . 



minja^ | 7/ 7^ 0}. Along with (4.12) this contradicts the fact 
are distinct. Thus 5; = as desired. 



2) Consider now the matrix D as defined in (4.2). The rows of D corresponding to linear 
spans are zero (see Proposition 4.8), while the other rows, corresponding to circular spans. 



are certain rows of the matrix Z in (4.10). Hence the nonsingularity of Z implies that those 
rows are linearly independent. All this shows that rk {D) = rk (A'^o) = |{^ I [o-h bi] circular}] = 
|£o| = rk (Mq), by virtue of Proposition |3.5[ 

3) From 2) we have rk (Aq) = rk (Mq). Now the identities rk (Aj) = rk {Mi) for general z G X 
follow from applying the shift a and making use of Remarks 3.4, 3.8[b), and 4.7 □ 



Theorem 4.12 shows that the set of KV-trellises is a subset of the set of (one-to-one) BCJR- 
trellises. The following example illustrates that this containment is proper, that is, the set 
of one-to-one BCJR-trellises is strictly larger than the set of KV-trellises. As a consequence, 
the class of non-mergeable one-to-one trellises is strictly larger than the class of KV-trellises. 
At the end of this section we will sketch a proof showing that the class of non-mergeable 
(one-to-one) trellises coincides with the class of (one-to-one) BCJR-trellises. 

Example 4.13 Consider 

/O 1 1 1 0\ 

G=\l 1 G F^''^ with span list 5 = [(1,3], (3,0], (2,1]]. 

Vo 1 1 1/ 

Since the rows of G are linearly independent, the product trellis T := Tq^s is one-to-one. 
Moreover, it has vertex spaces Vi = imMj, where 



^0 0> 
Mo = I 1 
.0 1> 



Ml 
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Hence the SCP of T is given by s = (2, 1, 1,2,2). One can straightforwardly verify that a 
characteristic pair of the code C = im G C is given by (X, T) with span matrix S, where 



X 



/I 1 
1 



1 

Vo 



1 
1 





1 1 





1 
1 

0/ 



T= [(0,2], (1,3], (3,4], (4,0], (2,1]], 5 



/O 1 1 0\ 

110 

1 

1 

Vi 1 1 1/ 



Using Remark 3.11 one can see directly that no three rows of X and their according spans 
will lead to the SCP above. Hence T is not isomorphic to a KV-trellis. Let us use the parity 
check matrix 

'10 111' 
,01100, 



H 



in order to compute the BCJR-trellis T(^g.h,s) of C. Applying (4.2) and (4.1) leads to 







Na 




Hence col{Ni) = col(Mj) for all i G {0, ...,4} and due to Theorem 4.11 the trellis T is 
isomorphic to the BCJR-trellis T(^g,h,s) as a consequence, T is non-mergeable. This 

can also be seen directly from the concrete trellis T^g,h,s) the figure below. Let us also 



briefly verify the statements about the ECP as given in Proposition 3.5 The edge spaces Ei 
are given by Ei = im {Ni, Gi, Ni^i), where Gi is the i-th. column of G. For sake of space we 
display the matrix (A^olG'ol-Ail • • • I-/V4IG4I A'o). It is given by 

















1 





1 


1 


1 





1 





















1 


























1 


1 








1 






1 








1 


1 








1 


1 


1 





1 


1 


1 








and shows that the ECP is given by e = (2, 2, 2, 3, 2). Hence e = s + (0, 1, 1, 1, 0) and the 
latter is indeed the indicator function of the set ^ = {1, 2, 3} of starting points of the spans. 
Likewise, we have e = (1, 1,2,2,2) + (1, 1,0, 1,0) and the latter is the indicator function of 
B = {0, 1,3}. All this illustrates the formulas given in Proposition 3.5 

Finally, from the graph below it can easily be checked that the dual trellis T^jj^ q , repre- 
senting C"*" = '\mH (see Proposition 4.4), is mergeable at time z = 0. As a consequence, the 
dual of a BCJR-trellis need not be a BCJR-trellis. Even more, the dual graph is disconnected 
and there does not exist a path from (Oil) G imA^J" to (000) G imA^J". Hence the dual trellis 



does not satisfy Lemma 3.6 





[Trellis T(G,H,S)1 



[Trellis T 



{H,G,N')' 
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In \15\ Thm. 3.1] it has been shown that a non-mergeable product trelhs is isomorphic to 
the corresponding BCJR-trellis. The following theorem generalizes this result. It shows that 
every (one-to-one) product trellis Tg,s can be merged to the corresponding BCJR-trellis and 
merging is accomplished by taking suitable quotients of the vertex spaces. An instance of 
this process has been given in Example 4.10 It also reminds us to be aware of the fact that 



the resulting BCJR-trellis may not be one-to-one. 



Theorem 4.14 Let S be as in (3.3) and consider the trellises T(^g,h,s) Tg,s along with 
the matrices Ni and Mi as in (4.2)/(4.1) and (3.4), respectively. Then 

(a) there exist suhspaces Wi C imMj such that kerMj © Wi = kevNi, 



(b) the quotient trellis T' ofT^s defined as T' = {V',E'), where V = U^JoV/ and E' 

.n-l : 



Urrn -E' with 



Vl = -iiaMi/Wi, E[ = {{aMi + Wi, aGi,aM,+i + Wi+i) \ a € F'^}, 
is isomorphic to the BCJR-trellis T(^g,h,s)- particular, C{T') = C. 



Proof: (a) First notice that by the very definition of Mi in (3.4) along with Proposi- 
tion [Xsjjb) we have MiNi = Ni and thus kerMj C kerA'j. Hence there exists a subspace 
Wl C F'' such that kerMj © = ker A^j. Define the subspace Wi := W^Mi := {aMi \ 
a £ VFj'} C imMj. Notice that (Mj)^ = Mj, thus Mj defines a projection and hence 
kerMj © imMj = F''. This implies kerMj n Wi = {0}. The inclusion Wi C ker Aj fol- 
lows immediately from the identity M^Ni = Aj along with the fact that Wl C ker Aj. Now 
let P G ker Ni. Then there exist a G kerMj and 7 G W- such that /3 = a + 7. But then 
/3 = a + 7(1 - Mj) + 7Mj and 7Mj G Wi while a + 7(1 - Mj) G ker Mj due to (/ - Mj)Mj = 0. 
All this proves (a). 

(b) First of all, by virtue of (a) we have dimT^' = dim(im Mj) — dim^Vj = k — (dim kerMj + 
dim Wj) = k — dimker A^j = rk Aj. Moreover, it is easy to see that 



imAj, aMj + Wj 



aA-j 



is a well-defined, surjective homomorphism and thus an isomorphism between the vertex 
spaces Vl of T' and im Aj of T^g,h,S) ■ But then it is clear from the very definition of the edge 
spaces E'- above and Ei as in Theorem 4.1 that T' and T(^g,h,s) isomorphic as defined 
in (e) in the introduction. □ 

We close this section with the following observation. 

Remark 4.15 In this remark we sketch a proof showing that the class of lineai|^ non- 
mergeable (not necessarily one-to-one) trellises coincides with the class of BCJR-trellises as 
in Definition |4.5[ but where the rows of G maybe linearly dependent. 
First of all, let G G F'^^" where rkG < fc. It not hard to show that Theorem 



3.3 



remains 



valid (with the exception that Tg^s is not one-to-one anymore) and that BCJR-trellises as 
in Theorem |4.1| and 4.5 can be defined in the same way. The resulting trellis T^g, h,s) still 



4.3 



4.8 



represents C = imG = ker//*". Moreover, one can easily verify that Propositions 
Corollary 4.9 Theorem [4.11 and Theorem 4.14 remain valid with the same proofs (with the 
exception of part 4.11[ b) ; see also the footnote on Page [?]) . For sake of clarity let us call the 
resulting BCJR-trellises genBCJR-trellises. 

Now the above implies in particular that every genBCJR-trellis is non-mergeable. For the 



Recall from Definition 2.1 a) that linear trellises are reduced. 
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converse, let T be a linear and non-mergeable trellis of C. By [3 Thm. 4.2] T is the product 
of elementary trellises T|r;,{a;,fe;]) ^ = 1, • • ■ , Jtt-, where {2:1, . . . , Xm} is a (not necessarily linearly 
independent) generating set of C and where (a^, 6;] is a generalized span of xu that is, x^j = 



for j [a/, bi] (notice the difference to our definition of a span in Def. |3.1| ). By [9l Lem. 4.3] 
each factor T^^^i^aiM] non-mergeable. But then it is easy to see that {ai,bi] is a span of xi 
in the sense of Def. 3.1 (that is, xi is nonzero at the endpoints ai and hi) for otherwise the 
elementary trellis could be merged at time a/ or bi. Hence T = Tq s in the sense of Def. 3.10 



but where the rows of G are possibly linearly dependent. By Theorem 4.14 T can be merged 
to the corresponding genBCJR-trellis, hence, by non-mergeability, T is a genBCJR-trellis. 
As a consequence, if T is one-to-one, then T is a BCJR-trellis. 



5 On a Duality Conjecture by Koetter/Vardy 



Koetter/Vardy have shown the following. 

Lemma 5.1 ([9], Lem. 5.11]) Let C C and both be codes with support I and let the 
characteristic span Ust of C be given by T = [(a/, bi], I = 1, . . . , n]. Then the characteristic 
span hst ofC^ is given by [(6/, a;], / = 1, . . . , n]. 

Moreover, in [9l Prop. 5.13] they proved the following: select k linearly independent rows of a 
characteristic matrix of C, with spans, say, (a;., 6/.], i = 1, . . . , A;, and construct the resulting 
KV-trellis T. Pick the n — k rows of the characteristic matrix of C"*" that do not have spans 
(bi-, ai-], i = 1, . . . ,k and construct the resulting product trellis T. Then the trellises T and T 
share the same SC P. Ho wever, while the trellis T for C is, by construction, a KV-trellis in the 
sense of Definition 3.10 the trellis T might not even represent C"*" because it is not a priori 
clear whether the underlying n — k rows are linearly independent. Koetter/Vardy conjecture 
that those rows are indeed always linearly independent and thus the trellis T is a KV-trellis 
for C"*". For all this one has to have in mind that Koetter/Vardy define the characteristic 
matrix in a way which makes it unique for a given code and their conjecture makes sense as 
phrased. However, in our definition a characteristic matrix is not unique and therefore the 
linear independence of the selected rows might even depend on the choice of that matrix. 
This is indeed the case as the following example shows. 



Example 5.2 Consider again Example |3.12[ The underlying binary code and its dual are 
C = {0000, 1100, 0111, 1011} and = {0000, 1101, 0011, 1110}. A characteristic pair for 
the dual code is given by 



A 


1 





A 


1 


1 





1 


1 


1 


1 





Vo 





1 


1/ 



[(1,0], (3,1], (0,2], (2,3] 



In Example 3.12 we observed that the last two rows of X are linearly independent and thus 
give rise to a KV-trellis of C The according spans are (2, 0] and (3, 2]. Hence the complemen- 
tary rows for C"*" are the first two rows of X, but they are linearly dependent. Replacing the 
first row of X with the codeword (1, 1, 1, 0) G C"*- and keeping the span (1,0], one obtains an- 
other characteristic matrix for for which that row selection is indeed linearly independent. 
All this shows that the conjecture of Koetter/Vardy does depend on the choice of the charac- 
teristic matrix. One might also want to observe that both X and X are the lexicographically 
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first characteristic matrix of their respective code and therefore the ones Koetter/Vardy sin- 
gled out in their approach. Thus the example also shows that their conjecture does not hold 
for this particular choice. 

For the specific classes of self-dual codes and cyclic codes the Koetter/Vardy conjecture has 
been proven in [7J. 

In the sequel we will show that for row selections leading to minimal KV-trellises (in the 
sense of Definition 2.2 a)) there always exists a characteristic matrix of C"*" such that Koet- 



ter/Vardy's conjecture holds true. 

Theorem 5.3 Let C, C F" both have support I. Furthermore, let (X, T) be a character- 
istic pair of C, where T = {{ai, 6;], / = 1, . . . , n]. Suppose the rows of X with indices li, . . . ,1^ 
are hnearly independent and give rise to a minimal KV-trellis. Then there exists a character- 
istic matrix XofC^ such that the n — k rows of X that do not have spans (6/ . , a/ . ] , i = 1 . . . , A;, 
are linearly independent and give rise to a minimal KV-trellis ofC^. 

In the proof we will use a dualization technique for trellises going back to Mittelholzer |13| 
and Forney [5, Sec. VII. D and Thm. 8.4]. Their construction of a dual trellis amounts to 
dualizing the edge spaces (called local constraints in [5j ) with respect to a particular bilinear 
form. In our case this bilinear form reads as in ( |5.2[ ) below. As the proof will show for 
minimal BCJR-trellises the resulting dual trellis coincides with the BCJR-dual defined in 



Proposition 4.4 



Proof: Without loss of generality let us assume {li,. . . ,lk) = (1, . . . , A;) and let G G F'^^" 
be the matrix consisting of the first k rows of X. Then G is a generator matrix of C. 



As before, let the parity check matrix H be as in (2.2). By Theorem 4.12 we know that 
the corresponding KV-trellis Tg,s is isomorphic to the BCJR-trellis T := T(^g,h,s)j where 
S = [{ai, bi],l = 1, . . . , A;] . Denote the sets of starting and end points by ^ = {ai, . . . , ak} 
and B = The vertex and edge spaces of T are given by imA^j and Ei := 

im (Aj, Gj, Aj+i), i e I, where Aj is as in (4.1), (4.2). Let Sj := rk(Aj), thus {sq, . . . ,Sn-i) 
is the SCP of T. 



By Proposition 



4.4 



the BCJE-trehis T 



'^(HGN^) represents C and has the same SCP. 
Its vertex and edge spaces are given by im Aj and Ei := im (Aj, i^/, Aj+i), where Aj = Nj^ . 
Since T is a mini mal t rellis the same is true for T; see also [15^ Thm. 4.1]. Therefore, by 
virtue of Theorem 



3.14 



the trellis T must be a KV-trellis for C . This means, there exists a 
characteristic pair {X, T) of such that T is a KV^-^ ^^-trellis, which in turn implies that 

there exists a selection of n — /c rows of X giving rise to the trellis T. Let S be the list of 
associated characteristic spans and let A be the set of starting points of 5. Thus, |^| = n — k. 

By virtue of Lemma 5.1 we have to show that A = {hk+i, • • • , ^n}- From Theorem 



4.11 



and 



Proposition 3.5 we know 



Ci := dimE'j = Si + if' and ej := dimE'j = Sj+i + If, i £ I. 



(5.1) 



Using that Aj = A7 one obtains the following well-defined non-degenerate bilinear form 



11: (imAj X F X imA,,+i) x (im Aj x F x im Aj+i) 



F 



+ ab- aAi+i/3^. 



(5.2) 
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Then we have for ah a €¥'^ and /3 G F"~^ and i £ I 

n((aiV„ aGi,aNi+i), {pNi^^H^, = aNip^ + aGiHi/3^ - aNi+ip^ 

= a{Ni + GiH,-Ni+i)/3^ = 0, 



where the last identity fohows from the very definition of Ni^i in (4.1). Hence Ei C {Ei)°, 
where {Ei)° denotes the dual space of Ei in im A^j x F x im A^j+i with respect to the bilinear 
form n. Using that dim(imiVi x F x imiVj+i) = dim(imA^j x F x imA^j+i) = Sj + Sj+i + 1 
and applying the second identity in (|5.1|), we conclude 



ii = dim < dim(£'j)° = Sj + Sj+i + 1 - Cj 



1 



Along with the first identity in (5.1) this yields 



I^ + lf <1 for alH G X 



(5.3) 



(5.4) 



for the indicator functions on A and B. Hence the sets A and B are disjoint and since 
the two sets are subsets of I of cardinality n — k and k, respectively, we may conclude 
A = I\B = {^fc+ii • • • ; bn} as desired. This completes the proof. 



One should notice that we now also have equality in (5.4) and (5.3) and thus derived the 
identity Ei = {Ei)°, which is an instance of [5, Cor. 8.2]. □ 

It is worth noting that the proof does not explicitly require the minimality of the trellis, but 



only that the dual trellis T = T, 



be a KV-trellis. We strongly believe that Theorem 



5.3 



is true for all KV-trellises (and not just minimal ones). Moreover, we believe that the BCJK 
dual of a KV-trellis is isomorphic to the trellis obtained from the dual row selection of a 
suitably chosen dual characteristic matrix. 

Remark 5.4 The proof above shows that for KV-trellises whose BCJR-dual is a KV-trellis 
again, this BCJR-dual coincides with the dual obtained by dualizing the edge spaces. In other 
words, Ei = {Ei)° for z G X in the notation of the proof above. We wish to point out that 
this is not always the case for BCJR-trellises, not even if they are one-to-one and isomorphic 
to the corresponding product trellis (recall that BCJR-trellises are always mergeable). An 

The trellis Tg,s = T(^G,H,S) has SCP (2, 1, 1, 2, 2) 



4.13 



example can be drawn from Example 
and ECP (2,2,2,3,2). The dual trellis T = T^^hgnJ) course, the same SCP and 



its ECP is (2, 1,2,2,2). According to (5.3) the trellis T° with vertex spaces imA^j and edge 
spaces {EiY has ECP (2, 1,2,2,3), hence E4 C (£4)° and Ei = {Ei)° for i = 0, . . . , 3. One 
computes 



G imiV4 X F X imiVo, 







1 


1 


1 





1 




{E^r = im 








1 















Vo 








1 











which results in the following trellis 




000 000 000 000 000 000 



[Trellis T° 
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As one can easily see, this trellis does indeed represent C- 



is one-to-one, but it is not reduced! 

Removing the 4 



As a consequence, it is not considered a linear trellis due to Definition 2.1 
cross-diagonal edges in (£4)° results in the BCJR-dual T^j^ q displayed in Example 



4.13 



Some Open Problems 



In Theorem 3.14 we have seen that every minimal trellis is a KV(x,t) "trellis for a particular 
choice of the characteristic pair {X, T). Example 3.15 showed that indeed a suitable choice 
of the characteristic matrix is needed for this result to be true. In that particular example 
a different choice of the characteristic matrix leads to a trellis that is (only) structurally 
isomorphic to the given trellis. All this gives rise to the question whether every minimal 
trellis arises, up to structural isomorphism, from a fixed characteristic matrix? It is 
easy to see that this amounts to investigating whether if a certain row selection of one 
characteristic matrix is linearly independent and gives rise to a minimal trellis of C, then 
the same row selection is linearly independent for each characteristic matrix of the code. 



If this turns out to be true then this implies that in Theorem 5.3 the statement would be 
true for each characteristic matrix of the dual code. 

Does Theorem 15 . 31 hold true for all KV-trellises? This would be answered in the affirmative 
if one can show that the BCJR-dual of a KV-trellis is a KV-trellis again. 



From Examples 3.12/5.2 it is clear that the duality result of Theorem 5.3 does indeed 



depend on the choice of the characteristic matrix of the dual code (at least, if the trellises 
are not minimal). However, several examples suggest that one may ask whether there 
exist a dual pairing of characteristic matrices X and X for C and C"*", respectively, such 
that Koetter/Vardy's duality conjecture holds true for these pairs, that is, such that the 
Koetter /Vardy-dual of a KV(x,t) "trellis ^ ^^(xf)' 
pies 



trellis. For instance, for Exam- 



3.12/5.2 



this is true for the pairing (X', X) and when pairing X with the (unique) 
remaining characteristic matrix of C^. 
Compare the BCJR-dualization with the dualization of trellises via dualizing the edge 
spaces as in [131 IS]- More specifically, under which conditions do these dual trellises 



coincide when using the bilinear form (5.2) 
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